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[57] ABSTRACT 

A method and apparatus for hierarchical character recogni- 
tion processing of ambiguous and noisy characters which 
produces highly reliable results at high levels of hierarchical 
processing. The invention first applies a universal classifier 
system (which may comprise one or more universal 
classifiers) to input image data, and identifies "suspicious" 
characters. The image data for suspicious characters is then 
applied to a "specialist" classifier that is designed to handle 
only a narrow and well-defined set of recognition cases. This 
hierarchical processing architecture and method results in 
increased accuracy of recognition. The method is particu- 
jyly^arfl >licable~,to~h anQ^ 
and noisy machine-printed characters. 

3 Claims, 2 Drawing Sheets 
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similar characters. Unfortunately, similarity exists even determination is then made as to whether the character is 

between characters of different classes sufficient to create "suspicious". For example, the character may have been 

ambiguities in the recognition of such classes. For example, reco gnized by a universal classifier system 8 as probably 

"I" V , and "r ^have gather sMar shapes to a recognition being a "4». The universal classifier system 8 can be 

system, as do "S" and "5", "4" and "9 , "3" and "8", and 5w — , - . 3 , / ac oJ~*- a ' 7 

several other "ambiguity" classes. The correct identification ^;F°fi^"^ T for exa mple, tojdways^^ W „ Jrc 

of such ambiguous characters requires extensive recognition Ff" a S ^F C10 ^ character^ecause it is often mistaken for ^ 

capabilities or me presence of context. The larger the set of * " 9 " (particularly with handwritten characters). Thus, the 

characters, the more ambiguity classes it possesses. A uni- candidate character is part of the ambiguity class containing 

versal classifier that is designed to recognize a full set of 10 "4" and "9". 

characters (such as all alphabetic characters and/or all If the character is not suspicions, then the probable 

numeric characters) is regularly overwhelmed by morpho- character determined by the universal classifier system 8 is 

SlS cSs ° haraCterS that hUmanS n0rmaUy ^ igU 10 ™W « a ™de « (^cp 104). If the character is suspicious 

^ " , t ... , , . ( ste P i** 2 )' a specialist classifier 9 for the suspicious char- 

To overcome this problem W th universa cUssiflers the is acter ^ se]ec(ed ( ^ ^ ^ 

mvention employs a hierarchy of "specialist' classifiers, 9 is ^ applied to ^ j dat& 3 ( m) ^ 

each configured to recognize characters be ongmg to distinct determincs ue most babk ^ ncie[ 

to be assigned to the 

ambigmty classes. That is, each specwl classifier is trained ^ dlta 3 ^ chafacter ^ ^ M , ^ 6 (st 

or built using known pnncmles to distinguish only the m m) Note , hat ^ determined b tfae 

differences between-characters in_anainhiguity__cias^e.g., CM ^ a u** «u u * j* - j 

I( tt — 7-7^7^ — T L . , ^ — rz. — j^a-A 5 , specialist classifier may be the same character determined as 

4 and "9 ); the special classifiers are not designed to l ■ „ . _ , , , ... . . , , 

. ' ^ t bem S mosl probable by the universal classifier system 8. 

process characters of other shapes. A classifier of this kind 

is trained (built) only on a large set of characters that belong Implementation 
to a specific ambiguity class. 25 . . , . 

. .. 4 . . - , , . The invention may be implemented in hardware (digital, 

The specialist classifiers may be implemented in any a „ a1 u„u~a a- i \ ^ 

. c I r tt . i analog, or hybrid digital-analog) or software, or a combi- 

desired fashion, usmg, for example, feature extraction algo- f < u c . 

. i i j . . • nation of both. However, preferably, the invention is lmple- 

nthms such as neural networks and syntactic or linguistic . . . t . . . 

, ... « . . „ , ' . lL , mented in computer programs executmg on programmable 

algorithms, nearest neighbor algorithms, and other algo- „ „. „ f ■ # i * 

.7. ' ... r* - , , ^ f- 30 computers each comprising at least one processor, a data 

nthms known in the art of character recognition. The^dis^ . „ r * . , . iM 

^— ^ — f-r f - • *u <r^-iT borage system (including volatile and non-volatile memory 

tinctive feature of the mvennon is that the full powerof such , . \ *T *t . • , 

and/or storage elements), at least one input device, and at 

methodsris-brought-to bear^on_a„specific ambiguity classy 1a . Ma . , . „ , . r t . , . 

/fe "3" and~"P "etc ~ output device. Program code is apphed to input 

y data to perform the functions described herein and generate 

EXAMPLE EMBODIMENT output information. The output information is applied to one 

FIG. 4 is a block diagram showing the basic architecture ° r more 0Ut P Ut devices > * ^ own fashion ' 
of the invention. Image data 3 is applied to a universal Eacn program is preferably implemented in a high level 
classifier system 8, which can comprise one or more uni- procedural or object oriented programming language to 
versal classifiers of the types known in the prior art. For 40 communicate with a computer system. However, the pro- 
unambiguous data, the universal classifier system 8 outputs grams can be implemented in assembly or machine 
a character code 6. language, if desired. In any case, the language may be a 

However, added to the universal classifier system 8 is the compiled or interpreted language, 
ability-to "call ,) a- specialist" classifier 9 trained or built-to .45 Each suc k computer program is preferably stored on a 
recoj^iz^^^ storage media or device (e.g., ROM or magnetic diskette) 

^classifier system 8. Any particular specialist classifier 9 is readable by a general or special purpose programmable 
^^^ecte^gased-upon "the^rgbable-identity of ;. a candidate computer, for configuring and operating the computer when 
character, as^jetermine^Jby the univereai classMer-s^^m 8, the storage media or device is read by the computer to 

<Cand„whether— the- cmdidate_character_is_ "suspicion*-'. A 50 perform the procedures described herein. The inventive 
suspicious character may be determined based upon any system may also be considered to be implemented as a 
desired criteria, such as ap parent size , the t ype of character , computer-readable storage medium, configured with a corn- 
level of gray in the image of the character, styles of P uter program, where the storage medium so configured 
hjmdwjaiing, prior knowledge that a character candidate has 55 causes a computer to operate in a specific and predefined 
been "surgically" separated from an adjoining character, or manner to perform the functions described herein. 
^ upon assignment of a^tnar acter capdidat g)by the universal In one implementation of the invention, use of specialist 
classifier system to a predefined character groups known to classifiers improved the error-rate by about 40% compared 
be ambiguous (e.g., the pair "4" and "9"; the group "I", "1", to the same system without specialist classifiers. 
"1", etc.). TbeJ'called" specialist classifier_9jnalyzes the 60 a number of embodiments of the invention have been 

iima^liata by performing.a reeogm^on algoriSrnimo^tOj) described. Nevertheless, it will be understood that various 
the candidate character and-then outputs a'probablexharaeo modifications may be made without departing from the spirit 

uejjcodg 6. an d scope of the invention. Accordingly, it is to be under- 

F1G. 5 is a flow chart showing one embodiment of the 6S stood that the invention is not to be limited by the specific 
invention. One or more universal classifiers 8 are applied to illustrated embodiment, but only by the scope of the 
image data 3 to generate a probable character (step 100). A appended claims. 
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